Model Selection

Multi-task Fine-tuning

# Multi-task Fine-tuning

Tooka SBERT V2 Large

A semantic text similarity and embedding model specifically designed for Persian, capable of mapping sentences into a dense vector space where semantically similar texts are positioned close to each other.

Olmo 2 0425 1B SFT

OLMo 2 1B SFT is a supervised fine-tuned version of the OLMo-2-0425-1B model, trained on the Tulu 3 dataset, designed to achieve state-of-the-art performance across multiple tasks.

Large Language Model

Transformers English

TRL is a reinforcement learning library based on the Transformer architecture for training and fine-tuning language models.

Large Language Model

Paligemma2 3b Mix 448

PaliGemma 2 is a vision-language model based on Gemma 2, supporting image and text inputs with text generation output, suitable for various vision-language tasks.

Camembertav2 Base

CamemBERTav2 is a French language model pretrained on 275 billion French text tokens, utilizing the DebertaV2 architecture, and excels in multiple French NLP tasks.

Large Language Model

Transformers French

Phico D Instruck

An instruction-following model fine-tuned based on T5-base, specifically designed to understand and execute complex instructions

Large Language Model

Transformers Supports Multiple Languages

Paligemma 3b Ft Refcoco Seg 896

PaliGemma is a lightweight vision-language model developed by Google, built upon the SigLIP vision model and Gemma language model, supporting multilingual text generation and visual understanding tasks.

TookaBERT is a family of encoder models trained on Persian, including base and large versions, suitable for various natural language processing tasks.

Large Language Model

Transformers Other

OpenELM is a set of open-source efficient language models that employ a hierarchical scaling strategy to optimize parameter allocation and improve model accuracy. It includes four parameter scales: 270M, 450M, 1.1B, and 3B, offering both pre-trained and instruction-tuned versions.

Large Language Model

Hyperion 2.0 Mistral 7B

A multi-domain language model fine-tuned on the Hyperion-v2.0 dataset, excelling in scientific reasoning and complex task processing.

Large Language Model

Transformers Supports Multiple Languages

Kafkalm 70B German V0.1

A large German language model developed based on Llama2 70B, specializing in German business scenarios

Large Language Model

Transformers German

H2o Danube 1.8b Base

A 1.8B parameter base language model trained by H2O.ai, based on an improved Llama 2 architecture with 16K context length support

Large Language Model

Transformers English

This is a fine-tuned large language model for mental health prediction using online text data.

Large Language Model

Transformers English

Ziya LLaMA 13B V1

A 13-billion-parameter pre-trained model based on the LLaMa architecture, capable of translation, programming, text classification, information extraction, summarization, copywriting, commonsense Q&A, and mathematical calculations

Large Language Model

Transformers Supports Multiple Languages

All Mpnet Base V2

Sentence embedding model based on MPNet architecture, mapping text to a 768-dimensional vector space, suitable for semantic search and text similarity tasks

Text Embedding English

GLM-2B is a general-purpose language model pre-trained with autoregressive blank filling objectives, supporting various natural language understanding and generation tasks.

Large Language Model

Transformers English

Deberta Base Combined Squad1 Aqa Newsqa And Newsqa

This model is a question-answering model based on the DeBERTa architecture, fine-tuned on the SQuAD1, AQA, and NewsQA datasets.

Question Answering System

Erlangshen MegatronBert 1.3B Sentiment

A Chinese sentiment analysis model based on the MegatronBert architecture, fine-tuned on multiple sentiment analysis tasks

Text Classification

Transformers Chinese

Erlangshen Roberta 330M NLI

A fine-tuned version based on the Chinese RoBERTa-wwm-ext-large model on multiple natural language inference task datasets

Text Classification

Transformers Chinese

Distilbert Base Uncased Squad2 With Ner With Neg With Multi With Repeat

A QA and NER model fine-tuned on the conll2003 dataset based on distilbert-base-uncased-squad2

Question Answering System

Tapas Large Finetuned Wtq

TAPAS is a table question answering model based on the BERT architecture, pre-trained in a self-supervised manner on Wikipedia table data, supporting natural language question answering on table content

Question Answering System

Transformers English

Robbert V2 Dutch Ner

RobBERT is the state-of-the-art Dutch BERT model, pretrained on a large scale and adaptable to various text tasks through fine-tuning.

Large Language Model Other

Data2vec Nlp Base

Data2Vec NLP Base is a natural language processing model converted from the fairseq framework, suitable for tasks such as text classification.

Large Language Model

Sew D Mid 400k Ft Ls100h

SEW-D-mid is a speech pre-training model developed by ASAPP Research, focusing on automatic speech recognition tasks, achieving a good balance between performance and efficiency.

Speech Recognition

Transformers English

Tapas Medium Finetuned Wikisql Supervised

TAPAS is a Transformer-based table question answering model, pre-trained in a self-supervised manner on English Wikipedia table data and fine-tuned with supervision on the WikiSQL dataset.

Question Answering System

Transformers English

T5 1.1 is Google's improved text-to-text transfer model, utilizing GEGLU activation function and optimized architecture, focusing on unsupervised pre-training

Large Language Model English

A lightweight DistilBERT model pre-trained on a large-scale Italian corpus, suitable for various natural language processing tasks.

Large Language Model

Transformers Other

Tapas Base Finetuned Wikisql Supervised

TAPAS is a BERT-based Transformer model specifically designed for table question answering tasks. It is pre-trained in a self-supervised manner on English Wikipedia table data and supports weakly supervised table parsing.

Question Answering System

Transformers English

S Biomed Roberta Snli Multinli Stsb

This is a sentence transformer model based on allenai/biomed_roberta_base, specifically fine-tuned for sentence similarity tasks, capable of mapping text to a 768-dimensional vector space.

BART is a Transformer model combining a bidirectional encoder and an autoregressive decoder, suitable for text generation and understanding tasks.

Large Language Model English

Minilm L12 H384 Uncased

MiniLM is a compact and efficient pre-trained language model, compressed through deep self-attention distillation technology, suitable for language understanding and generation tasks.

Large Language Model

Electra Large Generator

ELECTRA is an efficient self-supervised language representation learning method that replaces traditional generative pretraining with discriminative pretraining, significantly improving computational efficiency.

Large Language Model English

Tapas Large Finetuned Wikisql Supervised

TAPAS is a BERT-like Transformer model designed for table-based question answering tasks. It is pre-trained in a self-supervised manner on English Wikipedia table corpora and fine-tuned on the WikiSQL dataset.

Question Answering System

Transformers English

Tapas Small Finetuned Wtq

This model is a small version of TAPAS, specifically fine-tuned on the WikiTable Questions dataset for table-based question answering tasks.

Question Answering System

Transformers English

Tapas Medium Finetuned Wtq

This model is a medium-sized table question answering model based on TAPAS architecture, fine-tuned on WikiTable Questions dataset, suitable for table data QA tasks.

Question Answering System

Transformers English

Albert Fa Base V2

A lightweight BERT model for self-supervised learning of Persian language representations

Large Language Model

Transformers Other

Tapas Tiny Finetuned Wtq

TAPAS is a tiny Transformer model optimized for table question answering tasks, achieving table comprehension capabilities through intermediate pretraining and chained multi-dataset fine-tuning

Question Answering System

Transformers English

Mt5 Small Jaquad Qg Ae

A Japanese question generation and answer extraction model fine-tuned based on MT5-small, supporting generating questions or extracting answers from given text.

Question Answering System

Transformers Japanese

Wangchanberta Base Wiki Newmm

A RoBERTa BASE model pretrained on Thai Wikipedia, suitable for Thai text processing tasks

Large Language Model Other

Tapas Base Finetuned Wtq

TAPAS is a Transformer-based table question answering model, pre-trained on Wikipedia table data through self-supervised learning and fine-tuned on datasets like WTQ.

Question Answering System

Transformers English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase